NTT/NAIST's Text Summarization Systems for TSC-2
نویسندگان
چکیده
In this paper, we describe the following two approaches to summarization: (1) only sentence extraction, (2) sentence extraction + bunsetsu elimination. For both approaches, we use the machine learning algorithm called Support Vector Machines. We participated in both Task-A (single-document summarization task) and Task-B (multi-document summarization task) of TSC-2.
منابع مشابه
A survey on Automatic Text Summarization
Text summarization endeavors to produce a summary version of a text, while maintaining the original ideas. The textual content on the web, in particular, is growing at an exponential rate. The ability to decipher through such massive amount of data, in order to extract the useful information, is a major undertaking and requires an automatic mechanism to aid with the extant repository of informa...
متن کاملYet Another Summarization System with Two Modules using Empirical Knowledge
We previously proposed a summarization system, GREEN, for Japanese newspaper editorials. However, GREEN is not suitable for summarizing ordinal newspaper articles which are different from newspaper editorials. To participate in subtasks A-1 and A-2 of TSC (text Summarization Challenge) in NTCIR-2, we developed a new summarization system from scratch which copes with both ordinal articles and ed...
متن کاملEXTRACTION-BASED TEXT SUMMARIZATION USING FUZZY ANALYSIS
Due to the explosive growth of the world-wide web, automatictext summarization has become an essential tool for web users. In this paperwe present a novel approach for creating text summaries. Using fuzzy logicand word-net, our model extracts the most relevant sentences from an originaldocument. The approach utilizes fuzzy measures and inference on theextracted textual information from the docu...
متن کاملSystematic literature review of fuzzy logic based text summarization
Information Overloadrq is not a new term but with the massive development in technology which enables anytime, anywhere, easy and unlimited access; participation & publishing of information has consequently escalated its impact. Assisting userslq informational searches with reduced reading surfing time by extracting and evaluating accurate, authentic & relevant information are the primary c...
متن کاملElimination of Multiple Modifiers in Summarization
We propose a method that summarizes a Japanese sentence. The method aims to produce a natural and readable summary from the sentence. This method eliminates a part of multiple adnominal modifiers including adnominal clauses by employing natural language processing tools: KNP (a parser), and JUMAN (a morphological analyzer). With this proposed method, we participated in subtask A-2 (for producin...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002